Tesseract Ocr: a Case Study for License Plate Recognition in Brazil

نویسندگان

  • Dalton Matsuo Tavares
  • Glauco Augusto de Paula Caurin
  • Adilson Gonzaga
چکیده

This paper presents the analysis of Google’s Tesseract OCR for license plate recognition in Brazil. The performance results presented for Tesseract OCR will be compared to market grade OCR products known here as “A” and “B”. This is a necessary measure due to a confidentiality agreement with the company supporting this research. The use of OpenCV is also considered due to limitations inherent to Tesseract OCR.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optical Character Recognition by Open source OCR Tool Tesseract: A Case Study

Optical character recognition (OCR) method has been used in converting printed text into editable text. OCR is very useful and popular method in various applications. Accuracy of OCR can be dependent on text preprocessing and segmentation algorithms. Sometimes it is difficult to retrieve text from the image because of different size, style, orientation, complex background of image etc. We begin...

متن کامل

Recognition of Handwritten Roman Script Using Tesseract Open source OCR Engine

In the present work, we have used Tesseract 2.01 open source Optical Character Recognition (OCR) Engine under Apache License 2.0 for recognition of handwriting samples of lower case Roman script. Handwritten isolated and free-flow text samples were collected from multiple users. Tesseract is trained to recognize user-specific handwriting samples of both the categories of document pages. On a si...

متن کامل

Recognition of Handwritten Textual Annotations using Tesseract Open Source OCR Engine for information Just In Time (iJIT)

Objective of the current work is to develop an Optical Character Recognition (OCR) engine for information Just In Time (iJIT) system that can be used for recognition of handwritten textual annotations of lower case Roman script. Tesseract open source OCR engine under Apache License 2.0 is used to develop user-specific handwriting recognition models, viz., the language sets, for the said system,...

متن کامل

Lucrative Method for License Plate Recognition

Recent research initiatives have addressed the need for improved performance of license plate recognition accuracy that would profit many applications, ITS in particular. Different image processing techniques have been implemented for this purpose specifically edge detection, binarization, segmentation algorithm and tesseract. Each of these steps has its own strengths and weaknesses and it has ...

متن کامل

Comparison of Visual and Logical Character Segmentation in Tesseract OCR Language Data for Indic Writing Scripts

Language data for the Tesseract OCR system currently supports recognition of a number of languages written in Indic writing scripts. An initial study is described to create comparable data for Tesseract training and evaluation based on two approaches to character segmentation of Indic scripts; logical vs. visual. Results indicate further investigation of visual based character segmentation lang...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010